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Abstract — The Transmission Control Protocol (TCP) was de- 
signed to provide reliable transport services in wired networks. 
In such networks, packet losses mainly occur due to congestion. 
Hence, TCP was designed to apply congestion avoidance tech- 
niques to cope with packet losses. Nowadays, TCP is also utilized 
in wireless networks where, besides congestion, numerous other 
reasons for packet losses exist. This results in reduced throughput 
and increased transmission round-trip time when the state of 
the wireless channel is bad. We propose a new network layer, 
that transparently sits below the transport layer and hides non 
congestion-imposed packet losses from TCP. The network coding 
in this new layer is based on the well-known class of Maximum 
Distance Separable (MDS) codes. 

I. Introduction 

Current versions of TCP 0J like TCP-CUBIC and 
Compound-TCP (5) use elaborate techniques to avoid and 
to cope with network congestion, which is the main reason 
for packet losses in wired networks. However, the increasing 
deployment of wireless networks like IEEE-802.11 (WLAN) 
or IEEE-802.16 (WiMAX) imposes new challenges on TCP 
as a reliable transport protocol. In wireless networks, packet 
losses can occur due to effects like shadowing, interference, 
and multipath propagation to mention only a few. The problem 
is, that TCP in all it's current implementations reacts on packet 
losses with some congestion avoidance strategy, thus reducing 
throughput and transmission round-trip time. 

During the last few years, network coding became a promi- 
nent field of interest in the communications- and coding 
community. The main idea is to view packets as elements of an 
algebraic structure, e.g. a finite field or a vector space. Then, 
packet losses correspond to erasures and there exist well- 
known algebraic techniques to correct them. The most im- 
portant example of this approach is probably random network 
coding [4| based on the Rank metric 0-Q, the corresponding 
codes are frequently called Rank- or Gabidulin codes (8)- 
|[T0l . In this paper we follow a different approach for network 
coding, first proposed by Kabatiansky and Krouk ifTTI and 
further developed and extended by Sidorenko et al. Ifl2l . It 
utilizes MDS codes and the traditional Hamming metric. With 
MDS codes, it is possible to correct errors and erasures, the 
tradeoff parameter between the two can be selected by the 
system designer. 

The idea of combining TCP with network coding was first 
presented by Sundararajan et al. Ifl3l . following the Rank 
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metric approach. We complement their work by following the 
Hamming metric approach and by introducing a new MDS 
Network Coding layer (NC layer) between the IP and TCP 
layer. The NC layer is transparent to the established layers 
which enables an easy deployment. 

The paper is organized as follows. In Section [TT] we explain 
MDS codes and show their most important property for net- 
work coding. In Section [III] we explain the NC layer together 
with algorithmic descriptions of the transmitter and receiver 
sides. The properties and capabilities of TCP with the NC layer 
are investigated in Section [TV] by simulation. We also analyze 
how network coding-capable nodes affect a mixed network 
environment. 

II. Maximum Distance Separable (MDS) Codes 

Let a := (ao, . . . , a n _i) and b := (bo, ... , &„_i) be two 
vectors over an extended Galois field F g with q = 2 m where 
m is a non-negative integer. The Hamming distance dn (a, b) 
is defined as the number of differing coordinates of a and b, 
i.e. 

dn(a, b) := \{i : a,i ^ bi, < i < n — 1}|. 

Let C be a subspace of F^ 1 with dimension k. Define the 
minimum distance (i m i n as 

dmin := min {d H (a, b)}. 

a.beC 

Then, C(q; n, k, d m ; n ) := C is called a q-nary linear block code 
of length n, dimension k and minimum distance e? m ; n . 

One basic result of Coding Theory is the Singleton Bound, 
which bounds the minimum distance for given code length and 
dimension. The following results were established in [14], see 
also 031. 



Theorem 1 (Singleton Bound) The minimum distance of a 
block code C(q; n, k, d mln ) is bounded by 

d m in <n— k + 1. 

Proof: By definition of e? m i n there are no two codewords 
in C with d m j n coinciding coordinates, but there might be 
codewords with <i m i n — 1 coinciding coordinates. Their quantity 
is at most q"- d mi„+i. The total number of codewords must be 
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smaller or equal than this quantity, hence q < q 

An interesting subset of the block codes are those, that have 
maximum possible minimum distance. 



Definition 1 (MDS Code) A block code C(q; n, k, d m i n ) is 
called Maximum Distance Separable (MDS) ;/ it fulfills the 
Singleton Bound with equality, i.e. 

drain = 71 — k + 1. 

It can be shown that there are no MDS codes over F2 
besides the repetition codes, the single parity check codes 
and the codes without any redundancy. Probably, the most 
important MDS codes are the Reed-Solomon (RS) codes fl31 . 
Ifl6l over ~F q with q > 2. In our considerations, we will always 
assume that q is a power of two, i.e. q = 2 m . In this case, 
Elements of F 9 can be represented by binary vectors of length 
m. We do not use any of the code's properties in this paper 
except that they are MDS. 

The core of our proposed scheme is the following well- 
known property of MDS codes. 

Theorem 2 If C(q;n,k, d m i n ) is an MDS code, then any k 
coordinates o/cdC unambiguously determine c. 

Proof: We have d m i n = n — k + 1, hence in any codeword 
at most k — 1 positions can coincide. Thus any k positions 
uniquely determine the codeword. ■ 
This allows the following procedure. Assume that a binary 
message of length kq bit should be transmitted. Let us denote 
a symbol from F g as a segment. The binary message can be 
considered as an information vector containing k segments. 
Theses segments can be encoded into n > k segments using 
an MDS code C(q; n, k, d m - m ). The resulting n segments are 
transmitted over a network. Then, by Theorem [2] the receiver 
is able to reconstruct the n segments if it receives arbitrary 
k segments. 

III. Network Coding Layer 

We are now ready to explain our proposed scheme for End- 
to-End algebraic network coding based on MDS codes. It 
works as a transparent new layer between the Internet and 
the Transport layer of the Internet protocol stack ifTTl , see 
Figure [T] Transparent means that the functionality of the 
NC layer is hidden to the existing layers, which simplifies 
deployment in existing networks. 
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Fig. 1. The Internet protocol stack extended by the NC layer. 

In our considerations, we identify the Transport layer with 
the TCP layer and the Internet Layer with the IP layer. For 
simplicity, we assume a half duplex communication, i.e that 
payload data is only transmitted in one direction. Further we 
assume that the connection setup and termination is always 



initiated by the transmitter side TCP instance. The generaliza- 
tion to full duplex is straightforward and does not yield any 
additional insights. 

Connection management segments like SYN or FIN usu- 
ally pass the NC layer in a transparent manner. However, 
such segments invoke some special connection management 
segment handling routines, see Section IIII-CI for details about 
connection management. 

At the transmitter, data segments arriving from the TCP are 
buffered until k segments are in the buffer. As soon as the 
buffer contains k segments (the information vector), they are 
encodecQ into n segments (the codeword) using an MDS code 
C(q; n, k, <i m i n ). The parameters q and k of the code must be 
chosen such that their product kq matches the data segment 
sizes arriving from the TCP. 

To cope with this restriction the NC instance has to change 
the Maximum Segment Size (MSS) parameter |fl8l offered by 
the receiving TCP instance during the connection negotiation 
If the transmitting TCP instance uses segment sizes less than 
the (modified) MSS, the NC layer has to pad the data. Another 
approach would be to implement a strategy for segmenting in 
the NC layer. For convenience we assume in the following 
that the TCP instance always uses segments with size equal 
to the MSS. 

After encoding, the n segments are passed to the IP layer. 
At the receiver, the NC layer acknowledges every received 
segment from the IP layer. It collects up to k segments in a 
buffer. This quantity is sufficient for decoding since C bears 
the MDS property. After decoding, the NC layer hands the 
decoded k segments (the segments which arrived from the 
TCP at the transmitter) to the receiver side TCP. 

Back at the transmitter, the NC layer counts the acknowl- 
edgments of coded segments. If their number is greater than 
some system parameter which we call the speculative ACK 
threshold, then the NC layer acknowledges all k data segments 
to the TCP. This is a speculative acknowledgment since at this 
time the receiver side TCP did not necessarily receive all the k 
data segments - some might still be stuck within the NC layer. 
However, the receiver side NC layer will be able to reconstruct 
the missing data segments from one of the following received 
codewords. Our simulations show that this happens very fast 
if the system parameters are chosen appropriately. 

The NC layer behavior at the transmitter and receiver is 
shown in Algorithms [T] and |2] respectively. Note that the proce- 
dures handle_TCP_handshake() and handle_TCP_teardown() 
are both capable of handling incoming SYN segments from 
TCP and IP. 

After the coarse description of the NC layer, we now focus 
on some details. The NC layer copies most of the TCP 
layer's functionalities, e.g. sequence numbers and the header 
structure. The following subsections deal with the relations, 
similarities and differences between the NC and the TCP layer. 



1 See Section IIII-AI for details about which parts of the TCP segments are 
encoded. 



Algorithm 1: Network Coding Layer at the Transmitter 

while true do 
buffer <— [ ] ; 
ACKcount <- 0; 
while | buffer | < k do 

receive TCP segment from TCP; 
if TCP segment is SYN then 
|_ handle_TCP_handshake(); 

else if TCP segment is FIN then 
handle_TCP_teardown(); 
reset NC layer; 

else if TCP segment is RST then 
forward segment to IP; 
reset NC layer; 

else 

put TCP segment into buffer; 

encode k TCP segments into n NC segments; 
hand over the n NC segments to IP; 
while ACKcount < s do 

receive NC acknowledgment from IP; 
ACKcount <r- ACKcount + 1 
acknowledge all buffered segments to TCP; 



Algorithm 2: Network Coding Layer at the Receiver 

while true do 
buffer <— [ ] ; 
while | buffer | < k do 

receive NC segment from IP; 
if NC segment is SYN then 
|_ handle_TCP_handshake(); 

else if NC segment is FIN then 
handle_TCP_teardown(); 
reset NC layer; 

else if NC segment is RST then 
forward segment to TCP; 
reset NC layer; 

else 

put NC segment into buffer; 
hand over NC acknowledgment to IP; 
decode k NC segments into n; 
extract k information segments, hand over to TCP; 
discard TCP acknowledgment; 



A. Header Structure 

The NC layer protocol reuses parts of the TCP header 
without any modification. To reduce protocol overhead, these 
common header parts can be stripped off the TCP header 
before encoding of the TCP segments. They can be easily 
reconstructed at the receiver side by simple extraction from 
the NC header. The common and stripped-off header parts 
are source port, destination port, all control flags except 
ACK, and sequence number. See notes about the latter one 



in Section UlI-BI The reused and stripped-off header fields are 
blue shaded in Figure [2] 

The remaining TCP header fields are not required by the 
NC layer. Thus, they are not part of the NC layer header 
and become part of the encoded TCP segment, i.e. the NC 
layer payload data. The TCP header fields which are encoded 
include acknowledgment number, offset, reserved, window, 
checksum, urgent pointer and options. They are non-shaded 
on the left-hand side of Figure [2] 

Besides the reused TCP header fields, the NC layer adds 
two additional header fields, i.e. symbol indicator (symb.) and 
NC options. The new fields are yellow shaded on the right- 
hand side of Figure [2] The symbol indicator is responsible to 
determine the position of a segment within an MDS codeword, 
for details see the following section. The NC options are not 
used in the current basic version of our protocol. They might 
be used to signal adaptions of code rate or speculative ACK 
threshold in later versions. 
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Fig. 2. Header structure of the TCP (left) and NC (right) layers. 



B. Sequence Numbers and Acknowledgments 

We have already mentioned the symbol indicator field in 
the previous section. In the TCP layer, the sequence number 
is responsible to order segments at the receiver side. The 
sequence number is incremented by one for each transmitted 
segment. Consider an information vector consisting of k TCP 
segments. The position of each TCP segment is uniquely 
determined by it's sequence number. But now the information 
vector is encoded into a codeword consisting of n segments. 
The NC layer should be able to uniquely determine the 
position of each segment, this is achieved by the symbol 
indicator field. 

The procedure is as follows: The first k segments of an 
MDS codeword simply copy the sequence numbers from TCP 
to their NC sequence number field. Their symbol indicator 
field is set to all zeros. The remaining n — k segments share 
the sequence number of the fc-th segment but their symbol 
indicator field is incremented by one for each segment. Simple 
concatenation of sequence number and symbol indicator pro- 
vides unique identification of each segment's position within 



an MDS codeword. More precisely, let p, denote the TCP 
sequence number and v the symbol indicator. Then the value 
2 8 /i + v uniquely determines the position of each segment. 

The length of the symbol indicator field is 8 bit, hence an 
MDS codeword can contain at most 256 redundancy segments. 
This bounds the minimal code rate by 

k k 1 



n k + (n 



> 



256 " 257' 



k) ~ k 

which seems to be sufficient for all practical applications. 

No TCP acknowledgments are transmitted over an NC 
enabled network. On the receiver side, any TCP acknowledg- 
ments are discarded immediately. On the transmitter side, the 
TCP acknowledgments are "transmitted" only between the NC 
and the TCP layers. Hence, the ACK flag is the only flag which 
is not reused and stripped off before encoding. It is overwritten 
by the NC layer and used for NC acknowledgment segments, 
i.e. NC layer segments with empty payload transmitted from 
the receiver to the transmitter. 

C. Connection Management 

Besides the change of the MSS, the NC layer does not 
introduce any new routines for connection setup and tear- 
down. This functions are inherited from the TCP layer. If 
any connection management segment is detected by the NC 
layer it starts the corresponding routine. In case of a TCP 
teardown or a connection reset, the NC layer schedules a 
self-reset immediately. The result of this is that the TCP and 
the NC layer share a common state for the full duration of a 
connection. 

IV. Simulation Results 

In this section we give simulation results to obtain some 
insights in the general behavior, advantages and disadvantages 
of the proposed End-to-End algebraic network coding based 
on MDS codes scheme. All simulations were done using the 
network simulator NS-2 |fl9l and a basic network topology 
with one bottleneck link, see Figure [3] Traffic is generated by 
two FTP sources Al and A2 which transmit to sinks SI and 
S2, respectively. The bottleneck in this setup is link N3— s>N4, 
with a reduced data rate of only 1 Mbit/s. NC layer segments 
are erased within the network with probability PER (packet 
erasure rate). 
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Fig. 3. Basic network topology for throughput and fairness simulations. 

For all simulations we have used an algebraic code with rate 
R = k/n = 1/2 and MDS property. We refrain from giving 
the detailed code parameters here since this section is devoted 
to the general behavior of the NC scheme. 



Figure |4] compares the achievable network throughput in 
TCP segments per second for a range of PER values. As 
expected, the NC-enabled TCP is outperformed by traditional 
TCP in channels with low PER. However, when the channel 
state deteriorates the traditional TCP throughput quickly falls 
below the NC-enabled version's throughput, which remains 
more or less constant over the full PER range. This gives rise 
to a natural extension of the NC scheme, which adapts the 
coding rate according to the current state of the channel. The 
NC options field in the NC layer header can be utilized for 
this purpose, see Section UlI-AI 
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Fig. 4. Throughput comparison between NC-enabled TCP and traditional 
TCP for a range of packet erasure probabilities. 

The throughput gain of NC-enabled TCP can be inter- 
preted as follows. In a traditional TCP environment, the 
protocol copes with increased PER by retransmission of non- 
acknowledged segments by adaptive repeat request (ARQ). 
This causes a significant amount of delay, especially when 
the PER is very high. The NC-enabled TCP's strategy to 
cope with increased PER is to send redundant segments in 
advance, such that the receiver is not required to wait for 
requested retransmissions even if segments were erased in the 
network. Depending on the round trip time and the selected 
code parameters, this enables the NC-enabled TCP to achieve 
a higher throughput than traditional TCP as shown by our 
simulations. 

Besides throughput, the most important performance mea- 
sure of the NC-enabled TCP is fairness, i.e. how fair available 
network capacities are distributed among the transmitting 
nodes of a network. We should distinguish between fairness 

• of the NC-enabled TCP against traditional TCP and 

• of a pure NC-enabled environment. 

The fairness simulation results for both cases are shown in 
Figure [5] and Figure [6] respectively. In both figures, two 
connections are active over the bottleneck link. In terms of 
Figure |3] this means that node Al is transmitting to node SI 
and node A2 is transmitting to node S2. 



In the NC/TCP case of Figure [5] it can be seen that NC- 
enabled TCP does not behave fair against traditional TCP. It 
dominates the bottleneck link and at any time instance achieves 
a higher throughput compared to traditional TCP. However, 
traditional TCP still can use a constant and non-zero share 
of the bottleneck's capacity, and hence gradual deployment of 
NC-enabled TCP is still feasible. 
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Fig. 5. Fairness comparison in a mixed environment where the traditional 
TCP transmission starts before the NC-enabled TCP transmission. 

In the NC/NC case of Figure |6j we observe that after a 
short balancing period, both NC-enabled TCP links receive 
approximately the same share of the bottleneck's capacity. 
Thus, our scheme can be considered as fair in a homogeneous 
environment. 
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Fig. 6. Fairness comparison in an NC-enabled TCP environment. 

For reference, we also give fairness simulation results for a 
pure traditional TCP environment in Figure [7] 
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Fig. 7. Fairness comparison in a traditional TCP environment. 



V. Conclusions 

In this paper we proposed a network coding scheme based 
on MDS codes for application in TCP/IP networks over 
wireless links. A new layer between the IP and TCP layers 
was introduced, which hides segment losses caused by the 
data link layer to the TCP. This is achieved by encoding a set 
of k TCP segments into a larger set n of encoded NC layer 
segments and transmitting the larger set. The receiver is then 
able to reconstruct all n segments by receiving an arbitrary 
subset of cardinality k by using the MDS property of the used 
code. 

By simulations it was shown that the throughput of a 
connection is superior compared to traditional TCP if the 
packet erasure probability is sufficiently large. 

Furthermore, the fairness between two TCP streams was 
studied. It turns out that an NC-enabled TCP stream dominates 
a traditional TCP stream. However, the fairness property is 
retained if both streams use NC-enabled TCP. 

The proposed network coding scheme rises up an abundance 
of new research questions. First, optimal code parameters 
should be derived for realistic settings, i.e. the product of sym- 
bol length and dimension should match the average segment 
size of TCP to avoid padding. Second, the scheme should be 
modified such that it can adapt to the channel conditions. It 
is counter-productive to apply encoding of segments when the 
packet erasure rate is small and in this case the coding rate 
bounds the achievable throughput. In an adaptive version of 
the scheme, the code rate might be adapted such that high 
rates (in the extremal case even code rate R = 1 which means 
no encoding at all) are used at low packet erasure rates and 
smaller code rates are used in the opposite case. Third, one 
might think about exploiting the error correction capabilities 
of the utilized MDS codes, i.e. to cope with maliciously 
introduced packet errors. 
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